Demonstrating quantum algorithm acceleration with NMR quantum computer 
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In general, a quantum circuit is constructed with elementary gates, such as one-qubit gates and 
CNOT gates. It is possible, however, to speed up the execution time of a given circuit by merging 
those elementary gates together into larger modules, such that the desired unitary matrix expressing 
the algorithm is directly implemented. We demonstrate this experimentally by taking the two- 
qubit Grover's algorithm implemented in NMR quantum computation, whose pseudopure state is 
generated by cyclic permutations of the state populations. This is the first exact time-optimal 
solution, to our knowledge, obtained for a self-contained quantum algorithm. 
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I. INTRODUCTION 

A quantum computer is expected to solve some of 
computationally hard problems for a conventional dig- 
ital computer The realization of a practical quan- 
tum computer is, however, still challenging in many re- 
spects 0. One of the obstacles to any realization is a 
phenomenon known as "decoherence" . The number of 
gate operations is severely limited since a quantum state 
is vulnerable due to interactions with the surroundings. 
Many strategies to overcome internal and external deco- 
herence have been proposed, such as (1) quantum error 
correcting codes (2) decoherence-free subspaces Q (3) 
holonomic quantum computation pj, among others. In a 
conventional design of a quantum circuit, the so-called el- 
ementary set of gates such as single-qubit SU(2) 
rotations and CNOT gates, are utilized. This design is 
motivated by the universality theorem proved in Ref. y| . 
Suppose we are to implement an n-qubit unitary matrix 
^target- In the conventional implementation this matrix 
is decomposed into a product of SU{2) matrices acting 
bewteen a pair of basis vectors. Then a CNOT gate 
transforms one of the basis vectors to a new vector, so 
that the pair of the vectors forms a subspace correspond- 
ing to a single qubit on which the SU (2) matrix acts. 

Quantum algorithm acceleration is a totally new ap- 
proach to the decoherence issue. This principle is orig- 
inally proposed in the context of the holonomic quan- 
tum computation [tj llOl ITU and of Josephson junction 
qubits [H d Il4| . What is required to implement is 
not individual elementary gates but rather the n-qubit 
matrix [/target £ U(2 n ) realizing the given quantum algo- 
rithm. This matrix is directly implemented by properly 
choosing the control parameters in the Hamiltonian. The 
variational principle tells us that the gate execution time 
reduces, in general, compared to the conventional con- 
struction since the conventional gate sequence belongs to 
the possible solutions in direct implementation. 



The proposal El EI Q has been made for fictitious 
Josephson charge qubits, which are still beyond reach. It 
is the purpose of this paper to demonstrate the acceler- 
ation of a quantum algorithm using an NMR quantum 
computer at our hand. For this demonstration, we em- 
ploy two-qubit Grover's search algorithm whose initial 
state 1 00) is generated as a pseudopure state by cyclic 
permutations of the state populations |l5j . In this pro- 
cess, we need to prepare three different initial states using 
SU(4) transformations acting on a thermal equilibrium 
ensemble. It is found that the optimized pulse sequence 
reduces the gate operation time to 25% of the conven- 
tional pulse sequence operation time in two ensembles 
while it remains unchanged in one ensemble, which is 
already optimized using the conventional pulse sequence. 

The next section is devoted to the formalism of our 
approach. The Hamiltonian for the two-qubit molecule 
is introduced and the time-evolution operator is defined. 
Section III is the main part of this paper, where the ex- 
act solutions for Grover's algorithm are obtained. These 
solutions are experimentally verified with our NMR com- 
puter. Section IV is devoted to summary and discussion. 



II. TIME-OPTIMAL PATH IN NMR QUANTUM 
COMPUTATION 



In the present paper, we are concerned with an 
NMR quantum computer with a two-qubit heteronucleus 
molecule. To be more specific, we use Carbon-13 labeled 
chloroform as a computational resource throughout our 
theoretical and experimental analyses. The Hamiltonian 
of the molecule is 



H(j) = -Lj n [cos 0i (a x ®/ 2 /2) 

-Ul2 [COS (f> 2 (l2 ® CTx/2) 

+2wJa z ® <7 z /4, 



■ sin <f>i (a y ® h/2)} 

■ sin (f> 2 (l2 <8> o-y/2)] 

(1) 



2 



in the rotating frame of each nucleus. Here I2 is the unit 
matrix of order 2 while is the kth Pauli matrix. The 
parameter luu is the amplitude of the RF pulse for the 
ith spin while (pi is its phase. These four independent 
control parameters are collectively denoted as 7. 
The time-evolution operator 



in a relevant basis and hence 



U[-y(t)]=Texp 



H{i{t))dt 



(2) 



associated with the Hamiltonian (JTJ is a functional of 
7(f), where T is the time-ordered product and we employ 
the natural units in which h is set to unity. Note that 
U[y(t)] £ SU(A) since Q is traceless. Suppose we want 
to implement a unitary gate t/target- Then our task is to 
find a function j(t) such that 



(3) 



The solution of this problem is highly nontrivial and we 
often resort to numerical analysis [H El El El- Note, 
moreover, that we would like to construct time-optimal 
solutions among many possible solutions of Eq. (j3J to 
fight against decoherence. It must also be examined 
whether the optimal solution thus obtained has control 
parameters that are experimentally accessible. 

It was shown in Refs. 0, El that there exists a geo- 
metrical picture for the time-optimal solution in the case 
of NMR quantum computation for two-qubit heteronu- 
clear molecules. Here we summarize briefly the relevant 
aspects of their idea that are required for our investi- 
gation. The crucial observation is that the one-qubit 
rotation is carried out in a negligible time compared to 
that required for two-qubit operation generated by the J- 
coupling term in Eq. Q and hence time required for the 
one-qubit rotation can be neglected in evaluating the gate 
operation time. Then it is natural to consider the homo- 
geneous space SU(4)/K, where K = SU(2) <g> SU(2), in 
which two-qubit gates that differ by one-qubit operations 
Ui ® U2 £ K are identified. The path in the homoge- 
neous space is then generated by the J-coupling term 
in Eq. (Q. A remark is in order. The Hamiltonian 
contains neither a z ® I2 nor I2 <& o z and it seems impossi- 
ble to consider the equivalence class based on the whole 
subgroup K . Note, however, that these terms are easily 
generated by a x and a y as 



-ia.cT z /2 



^i-H(Jy /4g— iQ(T x 1 2 g — ina y /4 



(4) 



for example, and hence it makes sense to identify all the 
elements of SU(4), that differ with each other modulo the 
subgroup K. With the same token, the terms a x ® a x 
and a y ® cF y may be included in the J-coupling part of 



the Hamiltonian since 



-i-Kjtaj; 



xe 



— i-K Jta z ®a z /2 — m(a y ®I 2 +h ®<? y )/4 



(5) 



for example. It should be noted that the set {a x ^ia x , 
c y ,(Jz ® c z } comprises the Cartan subalgebra of SU(A) 



- i (ai <Jx <g>o- x +02 c y 0<j y +as <t z igi a z ) 



Suppose we want to find a time-optimal path j(t) con- 
necting the unit matrix I4 of order four and the target 
matrix [/target € SU(4). The prescription given above 
suggests that we should find the time-optimal path in 
SU(4)/K that connects the equivalence classes K and 



ftarget-ff ■ 

equation 



In other words, we have to solve the matrix 



target 

where K\ , K2 £ K and 



K 2 U J {t)K 1 



Uj{U) 



= e -(<7rJ/2)(t ICTa: 



i&x +*2 Ty ® CTy +t3&z ) 



(6) 



(7) 



The total execution time is then given by T — y^-. tj. 
The group SU(4) is 15-dimensional, while each of the Ki 
has six parameters and Uj(t) has three, thus 2x6+3 = 15 
parameters in total. It follows from Eqs. J3J and (JSJ) that 
the Hamiltonian Q contains the necessary and sufficient 
number of control parameters to construct an arbitrary 
quantum gate. 



III. TIME OPTIMAL SOLUTIONS OF 
GROVER'S ALGORITHM 

A. Grover's Algorithm 

The solution of the decomposition 10 is difficult to 
find and we have to resort to numerical methods in gen- 
eral. As an example, we work out in the present section 
Grover's seach algorithm [18j in the framework outlined 
in the previous section. Grover's algorithm for a two- 
qubit case is implemented by unitary matrices of the form 



U 



U01 






V 



-1 0\ 

1 

1 

1/ 



u w = 





-1 









-1 



-10 



-1 




1 






-1 





-1 






1 
0-100 
0-10 
-10 



(8) 



(9) 



(10) 



(11) 



where Uj picks out the desired "data" (ij) by operating 
on 1 00) as 



Uij |00) = e ia \ij), = 0,1) 



(12) 
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where we have written explicitly the possible phase on 
the rhs [HI. 

An interesting observation is that all the matrix ele- 
ments are either or ±1, which implies that C/y is not 
only in SU(4) but also in SO (4). This further restricts 
the degrees of freedom of the matrix since SO(A) is six- 
dimensional. In fact, our numerical results show that the 
decomposition JSJ takes a highly constrained form. 



sample is in a thermal equilibrium state and we will use 
the temporal averaging by cyclic permutations of state 
populations to obtain a pseudopure initial state. This is 
carried out by applying the unitary operators 

/ 1 \ 

Ucp = 1 = CNOT i2CNOT 21 (17) 
\0 1 0/ 



B. Optimization 



and 



According to the presecription given in the previous 
section, we search a pulse sequence of the form Uij — 
K 2 UjK\. Instead of solving this inverse problem, we 
define the equivalent variational problem which is easy 
to solve, at least numerically. Let 



U[j(t)] = K 2 { l2 )Uj (t)JTi(7i), 



(13) 



where 



KAji) — e i ( aia *+ bl < T y+ Cia '^ (g) e i ( a 2<?z+b2<Ty+C2<?z) ^4) 

and Uj{t) has been given in Eq. (0. Now we define the 
penalty function p[7(t)] as 



p[7W] = ||C/[ 7 W]-C/target||F, 



(15) 



where \\A\\-p — vctATA is the Frobenius norm of a ma- 
trix A. Note that p[7(£)] is positive definite and that its 
zeros are the absolute minima. Thus solving the inverse 
problem is recast into a variational task for finding the 
absolute minima of p["/(t)]. 

To find the absolute minima, we have generated 512 
initial conditions and searched for the optimal solutions 
with the polytope algorithm on a parallel computer with 
512 CPUs. To our surprise, the execution time T = 
U is discrete and assumes the values 



T = (n+ 1)/ J, 



0,1,2. 



(16) 



for all four cases. The ambiguity 2irn corresponds to the 
path leaving from I4 and traversing the compact group 
SU(4:) n times before hitting the destination E/target-^2- 
The result shows that Fig. 1 of 0] is misleading at 
least in the present case: the distance between the cosets 
I4K1 and C/targct-?^ i s unique in the sense that the only 
degree of freedom left is how many times the path tra- 
verses SU(A) before arriving at the target. 

The optimal execution time obtained here, however, 
is the same as that for the conventional pulse sequence 
[TH in all cases. In other words, the conventional pulse 
sequences are already time-optimal. For such a simple al- 
gorithm, time-optimization may be carried out by inspec- 
tion by experts. Therefore, we look for more complicated 
cases to demonstrate the power of this method. Suppose 
we would like to execute Grover's search algorithm with 
a room-temperature liquid state NMR computer. The 



U = 

^ cp 



f 1 
10 
1 

Vo 1 



= CNOT21CNOT12 



(18) 



to the initial thermal state before Uij is executed. Here 
CNOTy stands for the CNOT gate with the control bit 
i and the target bit j. 

We now present our search result for Uio, UioU cp and 
I/loll^, the other ij producing similar results. 

• U\q: An example of a typical time-optimal solution 
for Uio is 



Ki = h®h 



K? 



-i(ir/4)u z 



-<y y ®cy y ) ^g) 
s i(x/2\/2)(cr I .+cr y ) 



The execution time is T = l/J, which simply re- 
produces that for the conventional pulse sequence 
Therefore, the conventional pulse sequence 
is already optimized. We will employ the conven- 
tional pulse sequence for Uio i n the following. 

• UiqU cp : For this case, an example of the time- 
optimal pulse sequence is 



i(7r/4)cr x 



JJ = g— i(7r/4)<7 a { 



(20) 



i(ir/2)cr y i(-K/3V3){a x +(Ty+a :g ) 



,«("/2)<»i 



e i(7r/4)cr ye i(ir/4)o- I 



where the second line of K 2 shows the pulse se- 
quence made of the existing terms in the Hamilto- 
nian. It should be noted that this pulse sequence 
requires the execution time T — 1/2 J in spite of 
an additional gate U cp , which is composed of two 
CNOT gates and costs 1 /J of time to execute in the 
conventional pulse sequence. The execution time 
of the time-optimal pulse sequence is 25% of that 
for the conventional pulse sequence. The execution 
time for the other solutions takes discrete values 
(n+l/2)/J, (n = 0, 1, 2, . . .). This corresponds to 
a path traversing SU (4) n times before arriving at 

the ?7target- 
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UiolI^ p : An example of the time-optimal pulse se- 
quence is 



Uj 
K 2 



-i(7r/3\/3)(CT a .+CT„+CT Ji ) « j 



e -i(7r/4)cr Xe -i(7r/4)(J B 



_ e -i(7r/4)o 



(21) 



Note again that the execution time of the gate for 
the time-optimal pulse sequence is 25% of that for 
the conventional pulse sequence. In general, the 
execution time is (n + l/2)/J, (n = 0, 1, 2, . . .) as 
in the above case. 



The generators in the pulse sequences, that do not ex- 
ist in the Hamiltonian l|T)l. are rewritten in favor of the 
existing terms by making use of the conjugate transfor- 
mations Q and J5J. The results are summarized in Table 
I. Also shown in the Table are the results according to 
the conventional pulse sequence for the respective gate. 



C. Experiments 

In our experiments, we used 0.6 milliliter, 200 mil- 
limolar sample of Carbon-13 labeled chloroform (Cam- 
bridge Isotopes) in d-6 acetone [l^. Data were taken at 
room temperature with a JEOL ECA-500 (the hydro- 
gen Larmor frequency being approximately 500 MHz) 
spectorometer .20]. The measured coupling strength is 
J = 215.5 Hz and the transverse relaxation time T 2 is 
~ 7.5 s for the hydrogen nucleus while ~ 0.30 s for the 
carbon nucleus. The longitudinal relaxation time T\ is 
measured to be ~ 20 s for both nuclei. The spin 1 and 2 
in Table [I] correspond to Carbon-13 and H, respectively. 

Our experimental results are shown in Fig. ^ which 
shows the spectra corresponding to the 1 10} state. The 
spectra in Fig. ^ ( a ) were obtained by using hard pulses 
whose duration is 25 fis for ir/2 pulses. We have inten- 
tionally introduced longer pulses with 250 /its duration to 
see the effect of imperfections, see Fig.Q](b). In the first 
case, we can well ignore the time evolution due to the 
J-coupling while pulses are applied since the characterist 
time for the J-coupling is ~ 1/J ~ 5ms. In the lat- 
ter case, however, six 250 /is pulses amounts to the total 
duration of 1.5ms, which is comparable to 1 /J and we ex- 
pect that the difference in the number of pulses will mani- 
fest. Figuren(b) clearly demonstrates that time-optimal 
pulse sequences produce sharper main peak compared to 
the conventional pulse sequences and less unwanted sig- 
nal, showing the superiority of our solutions. We also 
expect that quantum algorithm accreleration should be 
effective to fight against decoherence. 
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FIG. 1: The spectra for the peak corresponding to 1 10) state. 
Negative signal amplitudes indicate that the carbon nucleus 
is in the state |1), while appearance of the main signal at the 
frequency of 77.5 ppm (instead of 79.2 ppm) implies that the 
hydrogen nucleus is in the |0) state. The insets show the sig- 
nals in the vicinity of 79.2 ppm where a signal may appear if 
the hydrogen nulceus has |1) component. The scales in the 
insets are the same as in the main panels, (a) The upper 
panel shows the spectra obtained with conventional (dotted 
line) and optimized (solid line) pulse sequences. Each n/2- 
pulse duration is set to 25 /is. The main peak produced by 
the optimized pulse sequences is slightly sharper than that of 
the conventional one. (b) The lower panel shows spectra with 
conventional (dotted line) and optimized (solid line) pulse se- 
quences, in which the duration of the 7r/2-pulse is now set 
to 250 /is. The signal produced by the optimized pulse se- 
quences is clearly better than that by the conventional ones. 
Note also that unwanted signal in the inset is weaker for the 
time-optimal pulse sequences. 



IV. CONCLUSIONS AND DISCUSSION 

In summary, we have demonstrated both theoretically 
and experimentally that quantum algorithms may be ac- 
celerated if the unitary matrix realizing an algorithm is 
directly implemented by manipulating the control param- 
eters in the Hamiltonian. We have verified this by imple- 
menting Grover's algorithm which picks out the "file" 
1 10) starting from the pseudopure state generated by 
cyclic permutations of the state populations. We ob- 
tained the time-optimal pulse sequences and compared 
the results with those obtained by the conventional pulse 
sequences. It turns out that the gate f/ 10 is already opti- 
mized in the conventional pulse sequence while the gates 
UioU cp and U\qU^ v required for the cyclic permutations 
are accelerated so that the execution time is 25% of that 
for the conventional pulse sequence in both cases. The 
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Conventional pulse sequence 


Gate 


Pulse sequence 


Execution time 


u 10 


1 

2 


-Y -(1/2J)-Ym-Xm-(1/2J)-Ym-Xm- 
-Y -(1/2J)-Ym-X -(1/2J)-Ym-Xm- 


1/J 


UioUcp 


1 

2 


-X -(1/2J)-X Y -(1/2J)-Ym-Xm-(1/2J)-Ym-Xm- 

X -(1/2J)-X -Y -(1/2J)-Ym-X -(1/2J)-Ym-Xm- 


2/J 




1 

2 


x -(i/2J)-X -Y -(1/2J)-Ym-Xm-(1/2J)-Ym-Xm- 

-X -(1/2J)-X Y -(1/2J)-Ym-X -(1/2J)-Ym-Xm- 


2/J 


Time-optimal pulse sequence 


Gate 


Pulse sequence 


Execution time 


u w 


1 

2 


-X -(1/2J)-Xm-Ym-(1/2J)-Y -Pi(45)- 
-X -(1/2J)-Xm-Y -(1/2J)-X -Ym 


1/J 


UwUcp 


1 

2 


-X -(1/2J)-Xm-Ym- 
-Ym-Ym- 


1/2J 


UioUcp 


1 

2 


-Y -X -(1/2J)-Xm- 


1/2J 



TABLE I: Control pulse sequences for Grover's algorithm, which picks out the 'file' 1 10) starting from the pseudopure state 
1 00), which is obtained by cyclic permutations. The carbon nucleus is the first qubit while the hydrogen nucleus is the second. 
The upper Table shows the conventional pulse sequences [l5| required to execute Grover's algorithm with the pseudopure state. 
The uppermost row shows the pulse sequence for the gate Uio, while the second and the third rows show those for UioUcp and 
UioU? p , respectively. Here X (Xm) and Y (Ym) denote 7r/2 pulse along x (—x) and y (— y) axis, respectively. The lower Table 
shows the time-optimal pulse sequences. The symbol Pi (45) denotes 7r-pulse along (1, 1, 0) direction of the Bloch sphere. 



number of the pulses required for UioU cp (ioU^ p ) is 4 (3) 
in the time-optimal pulses sequence, while it is 14 in both 
cases if the conventional pulse sequences are employed. 
The smallness in the number of pulses required leads to 
a higher-quality spectrum. 
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